Influence function for robust phylogenetic reconstructions.

نویسندگان

  • Avner Bar-Hen
  • Mahendra Mariadassou
  • Marie-Anne Poursat
  • Philippe Vandenkoornhuyse
چکیده

Based on the computation of the influence function, a tool to measure the impact of each piece of sampled data on the statistical inference of a parameter, we propose to analyze the support of the maximum-likelihood (ML) tree for each site. We provide a new tool for filtering data sets (nucleotides, amino acids, and others) in the context of ML phylogenetic reconstructions. Because different sites support different phylogenic topologies in different ways, outlier sites, that is, sites with a very negative influence value, are important: they can drastically change the topology resulting from the statistical inference. Therefore, these outlier sites must be clearly identified and their effects accounted for before drawing biological conclusions from the inferred tree. A matrix containing 158 fungal terminals all belonging to Chytridiomycota, Zygomycota, and Glomeromycota is analyzed. We show that removing the strongest outlier from the analysis strikingly modifies the ML topology, with a loss of as many as 20% of the internal nodes. As a result, estimating the topology on the filtered data set results in a topology with enhanced bootstrap support. From this analysis, the polyphyletic status of the fungal phyla Chytridiomycota and Zygomycota is reinforced, suggesting the necessity of revisiting the systematics of these fungal groups. We show the ability of influence function to produce new evolution hypotheses.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phylogenetic diversification of the globin gene superfamily in chordates.

Phylogenetic reconstructions provide a means of inferring the branching relationships among members of multigene families that have diversified via successive rounds of gene duplication and divergence. Such reconstructions can illuminate the pathways by which particular expression patterns and protein functions evolved. For example, phylogenetic analyses can reveal cases in which similar expres...

متن کامل

Concordance analysis in mitogenomic phylogenetics.

Here I advocate the utility of Bayesian concordance analysis as a mechanism for exploring the magnitude and source of phylogenetic signal in concatenated mitogenomic phylogenetic studies. While typically applied to the study of independently evolving gene trees, Bayesian concordance analysis can also be applied to linked, but individually analyzed, gene regions using a prior probability that re...

متن کامل

Strong Endemism of Bloom-Forming Tubular Ulva in Indian West Coast, with Description of Ulva paschima Sp. Nov. (Ulvales, Chlorophyta)

Ulva intestinalis and Ulva compressa are two bloom-forming morphologically-cryptic species of green seaweeds widely accepted as cosmopolitan in distribution. Previous studies have shown that these are two distinct species that exhibit great morphological plasticity with changing seawater salinity. Here we present a phylogeographic assessment of tubular Ulva that we considered belonging to this ...

متن کامل

Genome statistics and phylogenetic reconstructions for Southern Hemisphere whelks (Gastropoda: Buccinulidae)

This data article provides genome statistics, phylogenetic networks and trees for a phylogenetic study of Southern Hemisphere Buccinulidae marine snails [1]. We present alternative phylogenetic reconstructions using mitochondrial genomic and 45S nuclear ribosomal cassette DNA sequence data, as well as trees based on short-length DNA sequence data. We also investigate the proportion of variable ...

متن کامل

Measuring genome conservation across taxa: divided strains and united kingdoms

Species evolutionary relationships have traditionally been defined by sequence similarities of phylogenetic marker molecules, recently followed by whole-genome phylogenies based on gene order, average ortholog similarity or gene content. Here, we introduce genome conservation--a novel metric of evolutionary distances between species that simultaneously takes into account, both gene content and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Molecular biology and evolution

دوره 25 5  شماره 

صفحات  -

تاریخ انتشار 2008